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Abstract 

Background: The Oncofype DX® Prostate Cancer Assay is a multi-gene RT-PCR expression assay that was developed 
for use with fixed paraffin-embedded (FPE) diagnostic prostate needle biopsies containing as little as 1 mm of 
prostate tumor in the greatest dimension. The assay measures expression of 12 cancer-related genes representing 
four biological pathways and 5 reference genes which are algorithmically combined to calculate the Genomic 
Prostate Score (GPS). This biopsy-based assay has been analytically and subsequently clinically validated as a 
predictor of aggressive prostate cancer. The aim of this study was to validate the analytical performance of the 
Oncofype DX Prostate Cancer Assay using predefined acceptance criteria. 

Results: The lowest quartile of RNA yields from prostate needle biopsies (six 5 |jm sections) was between 19 and 
34 ng. Analytical validation of the process requiring as little as 5 ng of RNA met all pre-defined acceptance criteria. 
Amplification efficiencies, analytical sensitivity, and accuracy of gene assays were measured by serially diluting an 
RNA sample and analyzing features of the linear regression between RNA expression measured by the crossing 
point (Cp) versus the log2 of the RNA input per PGR assay well. Gene assays were shown to accurately measure 
expression over a wide range of inputs (from as low as 0.005 ng to 320 ng). Analytical accuracy was excellent with 
average biases at qPCR inputs representative of patient samples <9.7% across all assays while amplification 
efficiencies were within ±6% of the median. Assessments of reproducibility and precision were performed by 
testing 10 prostate cancer RNA samples over multiple instruments, reagent lots, operators, days (precision), and RNA 
input levels (reproducibility) using appropriately parameterized linear mixed models. The standard deviations for 
analytical precision and reproducibility were 1.86 and 2.11 GPS units (100-unit scale) respectively. 

Conclusions: The Oncofype DX Prostate Cancer Assay, a clinical RT-PCR assay specifically designed for use with 
prostate needle biopsies, has been analytically validated using very limited RNA inputs. The assay requirements and 
analytical performance will provide physicians with test results from a robust and reliable assay which will enable 
improved treatment decisions for men diagnosed with early-stage prostate cancer. 
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Background 

Prostate cancer is one of the most common malignan- 
cies in men with an estimated 250,000 new cases being 
diagnosed in the US in 2012, mostly as a consequence of 
PSA screening [1]. Given that most newly diagnosed 
cases represent low-risk disease, with less that 3% of 
men dying of prostate cancer, active surveillance rather 
than immediate treatment has been broadly endorsed by 
practice guidelines to be a valid treatment option for 
many men. Nonetheless, a large majority of men with 
low-risk, early stage disease still undergo aggressive 
intervention with radical prostatectomy and/or radiation 
therapy [2], despite their attendant long-term side effects 
and cost, in large part because clinicians are uncertain as 
to the accuracy of conventional methods for discrimina- 
ting low-risk from high-risk disease. A number of risk 
assessment tools (e.g. nomograms), based on clinical 
and pathologic features such as serum PSA level, 
clinical stage and biopsy Gleason Score, are currently 
being used for risk stratification of men with early- 
stage prostate cancer [3]. Although these tools have 
some predictive value, a substantial fraction of men 
expected to have low-risk disease are found to have 
more aggressive disease at prostatectomy [4]. Clearly 
there is a need for more accurate discrimination of 
low risk disease from aggressive prostate cancers at 
the time of diagnosis. 

There is a growing recognition that molecular 
biomarkers can complement conventional clinical and 
pathologic parameters to personalize the care of cancer 
patients. However, incorporation of biomarkers into 
standard clinical practice requires a level of validation 
which is not often achieved. In order for a biomarker to 
enter wide clinical practice it needs to demonstrate evi- 
dence of strong analytical validity, clinical validity, and 
clinical utility [5]. Indeed, successful clinical validation 
may be challenging unless the biomarker is based on a 
robust, analytically validated platform. Genomic Health, 
Inc. has developed a family of analytically [6,7] and 
clinically-validated, multi-gene real-time polymerase 
chain reaction (RT-PCR) assays (Oncotype DX' Assays) 
which identify underlying biology in an individual pa- 
tient's tumor to help guide treatment decisions in inva- 
sive node negative and node positive breast cancer 
[8-11], breast ductal carcinoma in situ [12] and stage 
II/III colon cancer [13-15]. Since its introduction in 
2004, the Oncotype DX Breast Cancer Assay has become 
widely used in standard clinical practice, and is incorpo- 
rated into the major oncology practice guidelines, in- 
cluding NCCN and ASCO [16,17]. 

In order to impact clinical decision-making at the time 
of diagnosis, the Oncotype DX Prostate Cancer Assay 
was specifically designed for analysis of fixed, paraffin- 
embedded (FPE) prostate needle biopsy tissue. Some of 



the key challenges in developing this biopsy-based assay 
for prostate cancer include the heterogeneous and multi- 
focal nature of the disease, and the very small amounts 
of tumor tissue available from diagnostic prostate needle 
biopsies. Most newly diagnosed prostate cancer patients 
harbor low-volume disease that may be diagnosed 
from a single biopsy core [18]. A series of analytical 
studies were performed to optimize sensitivity of the 
analytical component of the Oncotype DX Prostate 
Cancer Assay platform and enable processing of small 
volume tissue from prostate biopsies. Some of the most 
notable changes include addition of a multiplexed 
preamplification step after reverse transcription and 
combining the genomic DNA detection step with the 
quantitative PCR step. Considering that the RNA yields 
of many samples cannot be reliably measured with 
current methods, the sample quality is determined pri- 
marily by the expression of reference genes rather than 
mass as a measure of the amplifiable RNA. The analyt- 
ical portion of the Oncotype DX Prostate Cancer Assay 
was designed to target 20 ng as the nominal input, 
however the assay accommodates samples with lower 
concentration through variable RNA inputs, thus enab- 
ling the assay to perform well with RNA inputs which 
are 110-180 fold lower than required by the Oncotype 
DX Colon and Breast Cancer Assays [6,7]. The Oncotype 
DX Prostate Cancer Assay includes 5 reference genes 
and 12 cancer genes representing distinct biological 
pathways with a known role in prostate tumorigenesis: 
the androgen pathway {AZGPl, KLK2, SRD5A2, and 
FAM13Q, cellular organization {FLNC, GSN, TPM2, and 
GSTM2), proliferation (TPX2), and stromal response 
(BGN, COLlAl, and SFRP4). Reference gene norma- 
lization is used to control for sources of pre-analytical 
and analytical variability as well as allow for variable 
RNA inputs. Reference normalized expression of the 12 
cancer-related genes are used to calculate the Genomic 
Prostate Score (GPS), which has been shown to predict 
adverse prostate cancer pathology beyond conventional 
clinical/pathologic factors in a recently completed 
clinical validation study [19]. 

Prior to initiation of the clinical validation study, 
analytical validation studies, with pre-specified end- 
points and acceptance criteria, were conducted to 
ensure that the analytical component of the Oncotype 
DX Prostate Cancer Assay is well-controlled and pro- 
duces reliable assessment of RNA from individual 
FPE prostate tumor tissue. These studies demon- 
strated analytical validity of the gene assays com- 
prising the Oncotype DX Prostate Cancer Assay 
with respect to analytical sensitivity, bias, amplifica- 
tion efficiency, precision and reproducibility, and also 
validated the precision and reproducibility of the 
resulting GPS. 
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Methods 

Tumor blocks and samples 

FPE neecQe biopsies were provided by the Cleveland Clinic 
Pathology and Laboratory Medicine Institute and Glickman 
Urological and Kidney Institute (C. M-G., E.A.K., S.M.F.) 
and were centrally reviewed by two pathologists (C.M-G 
and S.M.F). The use of the samples has been approved by 
the Cleveland Clinic Institutional Review Board. All biop- 
sies were sectioned into eight 5 |im sections. The top and 
the bottom slides were hematoxylin and eosin (H&E) 
stained and the presence of tumor was confirmed by a 
board-certified pathologist. The tumor area was marked in 
the top H&E slide and the markings were transferred to the 
subsequent 6 unstained slides. The tumor was manually 
dissected from each unstained slide using a scalpel 
blade and transferred into a microcentrifuge tube. FPE 
prostate cancer samples from radical prostatectomies 
were purchased from ProteoGenex (Culver City, CA). The 
needle biopsy tumors were microdissected to exclude 
normal-appearing tissue. The selected radical prostatec- 
tomy blocks spanned a wide range of gene expression and 
range of GPS. Gleason score was assigned using the 2005 
International Society of Urological Pathology Consensus 
guidelines [20]. 

RNA extraction 

Paraffin from FPE samples was solubilized by Shandon 
Xylene Substitute (Thermo Fisher Scientific, Waltham, 
MA) and removed. Tissue was lysed and protein was 
digested with Proteinase K (SOOjig; Beckman Coulter, 
Beverly, MA). Nucleic acids were bound to paramagnetic 
beads using the Agencourt" FormaPure Kit (Beckman 
Coulter, Beverly, MA) and manipulated using Tecan" li- 
quid handling robots (Tecan, Mannedorf, Switzerland) 
with an integrated KingFisher" Flex magnetic particle 
processor (Thermo Fisher Scientific, Waltham, MA). 
DNA was digested with 200 units of DNasel (Promega, 
Madison, WI). Purified RNA was released from the 
paramagnetic beads and suspended in water. 

RNA quantitation 

RNA was quantified using the RiboGreen" fluorescence 
method (Invitrogen, Carlsbad, CA) according to the 
manufacturer's instructions. The limit of quantitation for 
the assay was 0.5 ng/ \iL. 

Reverse transcription 

RNA (up to 20 ng) was converted to complementary 
DNA (cDNA) by combining the Omniscript" RT kit 
(Qiagen, Valencia, CA) and a specific reverse primer for 
each gene assay using Tecan liquid handling robots. All 
primers (final concentration 50 nmol/L) were purchased 
from Integrated DNA Technologies (Coralville, lA). The 



RT reaction (30 |iL) was incubated at 37°C for 60 
minutes and then inactivated at 93°C for 5 minutes. 



Preamplification 

The cDNA was preamplified using custom TaqMan" Pre- 
Amp Master Mix (Life Technologies, Carlsbad, CA) and 
custom forward and reverse primers (55 nM final concen- 
tration) for each target gene, including the control assay 
for genomic DNA (gDNA) detection. The reactions were 
assembled using Tecan liquid handling robots, placed in a 
thermocycler (BioRad, Hercules, CA) and incubated under 
the following conditions: 95°C for 10 min followed by 8 
cycles of 95°C for 15 sec and 60°C for 4 min. 

qPCR and genomic DNA detection 

The amplified product was mixed with the forward and 
reverse primers and probes (Black Hole Quencher-2, 
Integrated DNA Technologies) for each of the gene as- 
says and for the gDNA detection assay {ARFl_promoter, 
designed to amplify a promoter region of ARFl) using 
Tecan liquid handling robots. QuantiTect" Primer Assay 
master mix (Qiagen, Valencia, CA) was used, and the 
5 \iL reaction was amplified for 45 cycles in a LightCycler" 
480 (Roche Applied Science, Indianapolis, IN) under the 
following conditions: enzyme activation (95°C, 15 min), 
amplification (95°C for 20 sec and 60°C for 45 sec; 45 cy- 
cles in total), cooling (40°C, 5 sec). The level of expression 
was calculated with the crossing point (Cp) method 
implemented by the Roche LightCycler 480 software ver- 
sion 1.5. All gene assays were measured in triplicate and 
required at least 2 valid wells. Oligonucleotide and 
amplicon sequences can be found in the Additional file 1 
and Additional file 2, respectively. 

Reference gene normalization and Genomic Prostate 
Score calculation 

The five reference genes (ARFl, ATPSE, CLTC, GPSl 
and PGKl) were selected in an independent study for 
their low inter-patient variability, lack of relationship to 
clinical outcome and robust analytical performance (data 
not shown). Reference gene normalization was used to 
adjust for potential sources of pre-analytical variability, 
such as fixation, RNA fragmentation and tissue quality, 
analytical variation including assay plate (RT, qPCR) and 
instrumentation (liquid handler or LC480), and varying 
RNA inputs (with 20 ng as the target input). Gene ex- 
pression was normalized by subtracting the aggregated 
expression of the reference genes from the weighted 
mean Cp for each of the 12 informative genes and cen- 
tering by adding 10 units to the result. The formula used 
to calculate the GPS is shown in Figure 1. 
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GPSu=0.735 X Stromal Group - 0.368 x Cellular Organization Group - 0.352 x Androgen group + 0.095 
X Proliferation 



Where 



SRD5A2 Thresh = 



(5.5 



TPX2 Thresh = 



ifSRD5A2<5.5 



(SRD5A2 otherwise 
(5.0 ifTPX2<5.0 
(tPX2 otherwise 



Stromal Group Score= 0.527 x BGN + 0.457 x COLlAl + 0.156 x SFRP4 

Cellular Organization Group Score = 0.163 x FLNC + 0.504 x GSN + 0.421 x TPM2 + 0.394 x 
GSTM2 

Androgen Group Score = 0.634 x FAM13C + 1 .079 x KLK2 + 0.642 x AZGPl + 0.997 x SRD5A2 
Thresh 

Proliferation Score = TPX2 Thresh 



The scaled GPS is then calculated as: 



GPS= <^ 



0 

100 

13.4x(GPS„ - 10.5) 



ifl3.4x(GPS„ -10.5)<0 
if 13.4x(GPS„ - 10.5) > 100 
otherwise 



Figure 1 Calculation of Genomic Prostate Score (GPS). The aggregate expression of 5 reference genes was used to reference normalize tlie 
expression of tlie 12 cancer-related genes. Normalized gene expression was used to calculate the individual group scores: stromal group score, 
cellular organization score, androgen groups score and proliferation score. Each of those group scores is algorithmically combined to calculate 
the unsealed Genomic Prostate Score (GPSu); the GPSu is then scaled to a 100-unit range GPS. A negative coefficient in the calculation of the 
GPS is associated with better outcome whereas a positive coefficient is associated with poorer outcome. 



Linearity 

Linearity was assessed by serially dOuting RNA and analyz- 
ing gene expression as a function of 14 input concentra- 
tions of RNA extracted from FPE prostate tumor tissue. 
Two different starting RNA concentrations were used 
(1,638 ng/well and 204 ng/well) where the higher starting 
concentration was used for the genes with the lowest ex- 
pression levels {TPX2 and SRD5A2), and the lower starting 
concentration was used for all other genes. For each 
starting concentration, three independent series of two-fold 
serial dilutions were prepared using a Tecan liquid handling 
robot. The serially diluted RNA was converted to cDNA, 
preamplified for 8 cycles of PGR and taken through 45 cy- 
cles of qPCR. The linearity of signal response between the 
measured gene expression (Cp) and the log base 2 of RNA 
concentration was evaluated for each of the individual gene 
assays. In total, for each gene and RNA concentration, 9- 
replicate PGR measurements were recorded and analyzed. 



Linearity was assessed for each gene using the polyno- 
mial method developed by Krouwer et al. [21]. The poly- 
nomial method utilized in these analyses first fit a 
quadratic and cubic regression models and compares, 
through tests of significance on the quadratic and/or 
cubic terms, whether the extra linear models are better 
fits to the data than a linear regression. Heteroscedas- 
ticity of the error variance was modeled as a log-linear 
function of (known) log input concentration. These tests 
were used to determine the presence of significant non- 
linearity (at a 0.05 alpha level). If significant non- 
linearity was detected, then the predicted Cp value from 
the best fitting polynomial model was compared with 
the corresponding predicted Gp value from the linear 
model. This difference in predicted Gp values (deviation 
from linearity, (DL)) was calculated at each input 
concentration. A maximum DL across concentrations 
of < 1 Gp was used to determine that a gene is 
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sufficiently linear over the concentrations studied. For a 
given concentration, if the DL was greater than or equal 
to 1 Cp, then that concentration was removed from the 
linear range until all concentrations met the DL criteria, 
thereby defining the limit of quantitation (LOQ). 
Additionally, to control large residual variation at low 
concentrations, which is a consequence of pream- 
plification of small starting amounts of cDNA, an 
additional criterion was applied where concentrations 
below and including the point at which the residual 
standard deviation was > 1.5 Cp were removed from 
the linear range, and polynomial model assessments 
were repeated. 

Amplification efficiency 

Estimates of amplification efficiencies were obtained for 
all gene assays and are provided by the formula: 

Efficiency = 100 x (2-^/''^°p<=-iJ 

where the slope is estimated from the regression of Cp 
measurements versus (known) log base 2 RNA concen- 
tration described above. 

Analytical accuracy 

The analytical accuracy of predicted RNA concentrations 
relative to the (known) input RNA concentrations was 
estimated for each of the 17 genes separately. Specific- 
ally, for every evaluable Cp measurement, an inverse 
prediction of RNA concentration was derived from the 
final linear model: 

y= ^o + ^ilog2(:«) 

An estimate of the accuracy of the assay was given by 
the mean percent bias in prediction at each known RNA 
concentration k {k = 1...14); namely, by 

3 mi 

Biasli = 

where J is the (known) input RNA concentration for the 
k"^ level and Xijk was the predicted RNA concentration 
obtained from inverse prediction for the i* plate 
(i=l,2,3), j* well (j=l,2,3) and k* RNA concentration 
level, rrii was the number of valid wells on the i"^ plate, 
and m was the total number of valid wells across all 
plates. Analyses were performed using restricted max- 
imum likelihood estimation (REML) in the PROC 
MIXED procedure of SAS version 9.2. 



Analytical sensitivity 

The analytical sensitivity of each of the gene assays 
was evaluated separately. Specifically, for each gene a 
nonlinear mixed effects model with log-linear variance 
function or unstructured covariance was used to model 
the expected heteroscedasticity in intra-assay response as 
a function of RNA concentration. This information was 
used to estimate the Limit of Detection (LOD) and LOQ 
of the assay on the Cp scale. For each of the 17 gene 
assays, the LOD was estimated by a lower 95% confidence 
bound on the mean expression at zero concentration (nat- 
ural original scale). Similarly, the LOQ on the Cp scale 
was estimated by the upper one-sided 95% confidence 
interval of the fitted (linear) model at the inverse predic- 
tion of ymin or at the minimum concentration fulfilling the 
linear range criteria if the inverse prediction of ymin was 
less than the lower linear range limit (on the concentra- 
tion scale). Analyses were performed using an iterative 
estimation scheme involving the PROC MIXED and the 
PROC NLMIXED procedure in SAS version 9.2. 

Analytical precision and reproducibility 

Data for the precision and reproducibility were gener- 
ated from 10 different FPE prostate cancer samples that 
were run 9 times each: 3 replicates at each of 3 different 
RNA inputs (5 ng, 10 ng, and 20 ng). Both precision and 
reproducibility were generated using multiple lots of RT- 
PCR reagents, multiple PCR instruments and multiple 
Tecan liquid-handling robots. The precision component 
for each of the 12 informative genes as well as the GPS 
was defined as the within RNA input standard deviation 
captured in the residual variation term from the linear 
mixed model described below: 

Yijk = ^I + «! + Ay+Eijk 

Where Y^\^ was the outcome measurement (either GPS 
or reference normalized gene expression), ^ was the 
overall mean effect, ai was the random effect of the i''* 
sample (i=l,2,..,10), distributed N(0,a^), \ was the fixed 
effect of the j"^ RNA Input (j=l,2,3), Eijk was the residual 
error ~ N(0, o^) where k indexes replicate (k=l,2,3), 
assumed independent of «[. 

The reproducibility component for each of the 12 
informative genes as well as the GPS was defined as 
the within and between RNA input standard deviation, 
captured in the residual variation term from the linear 
mixed model described below: 

Yijk = 1^ + "i+eijk 

Where Y^k was the outcome measurement (either GPS 
or reference normalized gene expression), ^ was the 
overall mean effect, ai was the random effect of the i* 
sample (i=l,2,..,10), distributed N(0,o^), eijk was the 
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residual error ~ N(0,af) where k indexes replicate 
(k=l,2,3), assumed independent of ai. 

Assay controls 

Each Oncotype DX Prostate Cancer Assay RT and 
preamplification assay plate has 5 wells containing pro- 
cessing controls: RNA extracted from prostate 
cancer FPE (a positive control for reverse transcription, 
preamplification and qPCR), human genomic DNA 
(a positive control for gDNA detection, Promega, Madison, 
WI), and 3 wells containing nuclease-free water (a negative 
control for contamination). In addition, all qPCR plates 
have 12 wells of positive (gDNA) and 12 wells of negative 
(nuclease-free water) controls. 



length X 0.75 mm biopsy core width x 0.030 mm tumor 
depth (based on availability of six 5 \im sections)) and 
sought to investigate distribution of RNA yields in such 
samples. In two separate studies [22,23], RNA was 
extracted from 46 and 167 biopsies. Using this data, the 
average RNA yields in the targeted tumor volume were 
estimated to be less than 50 ng and the lowest quartile 
was estimated to contain between 19 and 34 ng of RNA 
(Figure 2a and 2b). Considering variability in clinical 
practice, it is expected that biopsies from men with pros- 
tate cancer will be encountered which contain less than 
this amount of RNA; as such, we targeted an RNA input 
for the assay below the lower limit of the lowest quartile, 
in order to meet the needs of all patients. 



Results 

Distribution of RNA yields In the smallest diagnostic 
prostate biopsies 

We estimated that the smallest diagnostic biopsies would 
contain 0.0225 mm'' of tumor volume (1 mm tumor 



Amplification efficiency 

In order for normalization and multianalyte GPS result 
calculation to be accurate, amplification efficiencies of 
individual gene assays were required to be similar [24], 
defined as the median ± 20%. The median amplification 
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Figure 2 Estimates of RIMA yields in the smallest diagnostic biopsies, (a) RNA from 46 formalin-fixed prostate needle core biopsy specimens 
collected between 2003 and 2008 by the Cleveland Clinic was extracted and RNA yields were measured using the RiboGreen method. Tumor 
volumes were measured and all results were adjusted to represent RNA yields in the target tumor volume of 0.0225 mm^ (1 mm tumor length x 
0.75 mm width of diagnostic biopsies x 0.030 mm tumor depth), (b) AUA low and intermediate risk patients treated with radical prostatectomy at 
Cleveland Clinic between 1999 and 2007 with available biopsy tissues were included in the study. In total, RNA from 157 diagnostic biopsies was 
extracted and RNA yields were measured using the RiboGreen method. Tumor volumes were measured and all results were adjusted to represent 
RNA yields in the target tumor volume of 0.0225 mm^ (1 mm tumor length x 0.75 mm width of diagnostic biopsies x 0.030 mm tumor depth). 
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efficiency for the 17 gene assays was 93.9% and all effi- 
ciencies were within ± 6.3% of that value, demonstrating 
that gene-specific assay performance was comparable for 
all assays. Amplification efficiencies ranged from 88% 
(AZGPl) to 100% {TPM2). Amplification efficiencies of 
individual assays, including 95% confidence intervals are 
shown in Table 1. 

Analytical sensitivity 

For a multianalyte assay to be clinically useful, it must also 
be able to distinguish between signal and background non- 
specific signal. All gene assays had LODs that surpassed 
the pre-specified Cp criteria of 35. Moreover, to make ac- 
curate expression measurements, LOQs need to be in a 
range of Cp values that are higher than the estimates for 
gene expression within the intended specimen type. Ac- 
ceptance criteria were pre-specified for each gene assay 
separately, and aU were met. Most of the assays had esti- 
mated LOQs between Cp=33 and Cp=35 effectively indi- 
cating that most individual assays can accurately quantitate 
several copies of RNA. The lowest estimated LOQ of 29.4 
was for the KLK2 assay. Considering that this value was 
established at its final dilution point, it is possible that the 
true LOQ is higher. However, even with this estimated 
LOQ, the discriminative capability of the KLK2 assay is ap- 
propriate for the expected population distribution (median 



Table 1 Amplification efficiencies of individual gene 
assays in the Oncotype DX Prostate Cancer Assay 
reporting the GPS 



Gene Assay 


Estimated Amplification 
Efficiency 
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Estimated amplification efficiencies and 95% confidence Inten/als for the 17 
genes comprising the GPS derived from a 15-polnt dilution series. 



population expression level has been estimated to be ap- 
proximately Cp=20). Table 2 shows data for the estimated 
LOQ and LOD for each gene assay. 

Linear range 

FPE RNA was serially diluted two-fold and performance 
of the individual gene assays was assessed over a linear 
range of 14 RNA concentrations. Table 3 lists the linear 
ranges for all gene assays, which ranged from 10 log 
units {FLNC, GSTM2) to the full 14 log units (16,384- 
fold) RNA concentrations studied (ARFl, BGN, CLTC, 
COLlAl, KLK2, TPM2). The smallest amounts that 
could be accurately quantified ranged from 0.025 ng to 
1.6 ng RNA per PCR well. These results demonstrate 
the performance of these gene assays to quantify cDNA 
over a wide range of input concentrations that can be 
encountered in small prostate needle biopsies. 

Analytical accuracy 

For all 17 GPS gene-specific assays, the absolute value of 
the average percent accuracy at the nominal, one-half 
and the one-quarter qPCR input was required to be less 
than 25% in order to meet pre-specified criteria. All as- 
says met these criteria and the results for accuracy are 
listed in Table 4. The largest analytical bias was observed 
for the ATP5E assay (9.7%) and all other assays displayed 
a bias smaller than 5% in absolute value. Both the ampli- 
fication efficiency and the accuracy at the individual 



Table 2 Limits of quantitation and detection 



Gene Assay 


Est. LOQ (Cp) 


Est. LOD (Cp) 


ARFl 


334 


38.9 


ATP5E 


33.6 


40.0 


AZGPl 


34.7 


40.0 


BGN 


35.3 


36.3 


CLTC 


34.3 


40.0 


COLIAI 


33.2 


37.0 


FAM13C1 


35.3 


40.0 


FLNC 


32.7 


40.0 


GPS1 


33.5 


40.0 


GSN 


334 


40.0 


GSTM2 


33.2 


40.0 


KLK2 


294 


40.0 


PGKl 


354 


40.0 


SFRP4 


34.8 


40.0 


SRD5A2 


33.5 


36.6 


TPM2 


334 


40.0 


TPX2 


344 


36.3 



Analytical sensitivity (estinnated Limits of Quantitation and Limits of Detection) 
for each of the 17 genes comprising the GPS derived from the 15-point 
dilution series. 
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Table 3 Linear range 

Gene Linear Range Quadratic Cubic IVIaximum Absolute 



Assdy 


(ng/well) 


p-valuG 


p-value 


Deviation from 
Linearity 


ARFl 


2-" to 2" 


<.0001 


0.1191 


0.37 


ATP5E 


2 '' ' to 2" 


<.0001 


0.0418 


0.82 


AZGP1 


2 '' ' to 2" 


0.0006 


0.1143 


0.46 


BGN 


2-" to 2" 


0.0030 


0.4064 


0.57 


CLTC 


2-53 2" 


0.0145 


0.2198 


0.28 


C0L1A1 


2-" to 2" 


0.2406 


0.0268 


0.31 


FAM13C1 


J'-' to 2" 


0.9180 


0.0026 


0.81 


FLNC 


2-13 2" 


0.0002 


0.0001 


0.17 


GPSl 


2-^-^ to 2'-' 


0.3704 


<.0001 


0.40 


GSN 


2-" to 2" 


<.0001 


0.8081 


0.57 


GSTM2 


J'-' to 2" 


0.2461 


0.0009 


0.23 


KLK2 


2-" to 2" 


0.3600 


0.0008 


0.16 


PGKl 


2-^-^ to 2" 


0.0224 


0.5250 


0.30 


SFRP4 


2-^-^ to 2" 


0.5033 


0.1019 


N/A 


SRD5A2 


20.7 2"'-'' 


0.0002 


0.1016 


0.34 


TPM2 


2 " to 2" 


<.0001 


0.2190 


0.43 


TPX2 


207 -,,0.7 


0.5396 


0.5799 


N/A 



N/A: No statistically significant departure from linearity. 
Estimated linear ranges, associated p-values for quadratic or cubic regression 
models and maximum deviations from linearity for each of the 17 genes. 
The results were calculated using the data obtained from the 15-point 
dilution series. 

concentrations are measures of the assay's ability to de- 
tect the doubling in the amount of RNA for individual 
genes in 1 Cp increments. The combination of low mea- 
sures of biases and amplification efficiencies approaching 
100% demonstrated robust analytical performance at 
RNA inputs expected to be found in prostate biopsies. 

Precision and reproducibility 

Precision was evaluated by examining variation within the 
same RNA input, and reproducibility incorporated both 
within and between RNA input variation. Pre-specified ac- 
ceptance criteria were set for the average within RNA input 
standard deviation pooled between 20 ng, 10 ng and 5 ng 
RNA input (precision) and the within and between RNA 
input (20 ng, 10 ng and 5 ng) standard deviation (reprodu- 
cibility). The lowest input of 5 ng was chosen to mirror the 
lowest 2.5 percentile of the 0.0225 mm tumors. A total of 
10 blocks were chosen to span a wide range of GPS results. 
Each RNA extract was processed three separate times at 
three different input levels (5 ng, 10 ng and 20 ng) by sev- 
eral operators over a two week period. Multiple lots of oli- 
gonucleotides, preamplification master mixes, and qPCR 
master mixes were used and processed on different Tecan 
liquid-handling robots and LightCycler 480 instruments. 
The standard deviation for precision was 0.21 Cp or 



Table 4 Analytical accuracy 


Gene Assay 


Average % Bias 


ARFl 


3.0 


ATP5E 


9.7 


AZGP1 


1.8 


BGN 


3.6 


CLTC 


-0.3 


C0L1A1 


1.1 


FAM13C1 


-0.8 


FLNC 


1.2 


GPSl 


4.9 


GSN 


4.8 


GSTM2 


3.1 


KLK2 


2.2 


PGKl 


1.9 


SFRP4 


3.9 


SRD5A2 


-1.9 


TPM2 


1.0 


TPX2 


-2.0 



The estimated analytical accuracy of predicted RNA concentrations relative to 
the known input RNA concentrations for each of the 1 7 Oncotype DX Prostate 
Cancer Assay gene assays; the average % bias is taken over the nominal, 
V2 and 'A qPCR input levels. 



smaller for individual gene assays. For GPS, standard devi- 
ation was 1.86 units on the 100-unit scale. The standard 
deviation for reproducibility was also 0.21 Cp or smaller 
for all assays. Reproducibility variation for the GPS was 
2.11 units on the 100-unit scale. Tables 5 and 6 show re- 
sults for the gene assays and the GPS for precision and re- 
producibility, respectively. AU precision and reproducibility 
measurements met pre-specified acceptance criteria thus 
demonstrating a robust analytical performance. These re- 
sults establish that average of the reference genes can be 
used to enable processing of variable RNA inputs while 
maintaining high precision and reproducibility. 

Controls 

The box plots presented in Figure 3 illustrate the per- 
formance of the RT-PCR positive control across 18 con- 
trol plates (54 individual Cp measurements per gene) 
run during analytical validation and the subsequent clin- 
ical validation. Table 7 lists the summary statistics, in- 
cluding standard deviations which were calculated on 
the non-normalized scale. These plates were run on 
multiple PGR and liquid handling robots by multiple op- 
erators, using multiple lots of critical reagents spanning 
a period of 20 weeks. The performance illustrates a 
process that is well in control with overall very low vari- 
ation (the largest SD was 0.33 Cp). Figure 4 presents box 
plots of the gDNA and qPCR positive control, which are 
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Table 5 Analytical precision for normalized gene 
expression and GPS 



Gene Assay / Precision SD reference 
Score normalized Cp 


LOWci yjvo 

CL, Cp 


upper yjvo 

CL, Cp 


ARFl 


0.08 


0.07 


0.10 


ATP5E 


0.1 5 


0.13 


0.18 


AZGPl 


0.14 


0.12 


0.16 


BGN 


0.1 1 


0.09 


0.13 


CLTC 


0.07 


0.06 


0.09 


C0L1 Al 


0.10 


0.08 


0.11 


hAMl JLl 


0.10 


0.09 


0.12 


FLNC 


0.1 1 


0.09 


0.13 


GPSl 


0.08 


0.07 


0.09 


GSN 


0.10 


0.09 


0.12 


GSTM2 


0.13 


0.11 


0.15 


KLK2 


0.08 


0.07 


0.09 


PGK1 


0.08 


0.07 


0.10 


SFRP4 


0.11 


0.10 


0.13 


SRD5A2 


0.20 


0.17 


0.23 


TPM2 


0.10 


0.09 


0.12 


TPX2 


0.21 


0.18 


0.25 


GPS 


1.86 


1.60 


2.20 



Analytical precision (assay variation within a given RNA input level ) of the 17 
Oncofype DX prostate gene assays and GPS was derived from 10 different FPE 
prostate cancer samples that were run 9 times each: 3 replicates at each of 3 
different RNA inputs: 5 ng, 10 ng and 20 ng (target). The within RNA input 
level sources of variation include instruments (qPCR and Tecan liquid handling 
robots}, reagents lots (oligonucleotides, preamplification master mix and qPCR 
master mix), operator and time (processing spanned 2 calendar weeks). 



also reflective of good assay performance due to their 
low variability. 

Discussion 

Prostate cancer is one of the most common malignan- 
cies in adult males and is typically diagnosed using small 
needle core biopsies. While many men with newly diag- 
nosed prostate cancer harbor indolent disease, most are 
treated with immediate surgery or radiation. A molecu- 
lar assay capable of obtaining reliable, clinically relevant 
genomic information from small amounts of tumor tis- 
sue available in diagnostic core biopsies could help more 
accurately identify men with low risk of clinical progres- 
sion who could be managed by active surveillance. In 
this manuscript we demonstrate that the Oncotype DX 
Prostate Cancer Assay can accurately and reproducibly 
quantitate gene expression at RNA input levels 110-180 
fold less [6,7] than the other Oncotype DX assays. 

The Oncotype DX Prostate Cancer Assay is a multi- 
gene RT-PCR assay specifically designed to analyze under- 
lying tumor biology in tumor tissue from diagnostic FPE 



Table 6 Analytical reproducibility for normalized gene 
expression and GPS 



Gene Assay / Reproducibility SD, reference Lower Upper 
Score normalized Cp 95% CL 95% CL 



ARFl 


0.09 


0.08 


0.1 1 


ATP5E 


0.1 7 


0.15 


0.20 


AZbr 


0.1 5 


0.13 


0.18 


BGN 


0.1 1 


0.09 


0.13 


LL 1 L 


0.07 


U.Ud 


n An 

u.uy 


COLl Al 


0.1 0 


0.09 


0.12 


FAM1 3L1 


0.1 1 


0.09 


0.12 


CI 


0.1 1 


0.10 


0.13 


GPSl 


0 08 


007 


0.09 


GSN 


0.14 


0.12 


0.16 


GSTM2 


0.14 


0.12 


0.16 


KLK2 


0.08 


0.07 


0.09 


PGK1 


0.10 


0.09 


0.12 


SFRP4 


0.12 


0.10 


0.14 


SRD5A2 


0.20 


0.17 


0.23 


TPM2 


0.10 


0.09 


0.12 


TPX2 


0.21 


0.18 


0.25 


GPS 


2.11 


1.83 


2.50 



Analytical reproducibility (assay variation incorporating both between RNA 
input level and within RNA input level variation) of the 17 Oncofype DX 
prostate gene assays and GPS was derived from 10 different FPE prostate 
cancer samples that were run 9 times each: 3 replicates at each of 3 different 
RNA inputs: 5 ng, 10 ng and 20 ng (target). The within RNA input level sources 
of variation include instruments (qPCR and Tecan liquid handling robots), 
reagents lots (oligonucleotides, preamplification master mix and qPCR master 
mix), operator and time (processing spanned 2 calendar weeks). 



core needle biopsies. In order for a biomarker to enter 
wide clinical practice, evidence of strong analytical validity 
needs to be demonstrated [5]. Prior analytical validation 
studies performed by Genomic Health, Inc. have recendy 
been cited as a model of how such validation studies 
should be performed [25] and a similar approach to ana- 
lytical validation was used for the Oncotype DX Prostate 
Cancer Assay. All pre-specified analytical criteria (i.e. 
amplification efficiency, analytical sensitivity, bias, repro- 
ducibility and precision) were met, and the analytical assay 
has been demonstrated to be reliable for clinical use even 
for patients from whom only limiting amounts of biopsy 
material are available. 

Prostate cancer is often diagnosed from a single positive 
diagnostic biopsy containing no more than 1 mm of tumor 
in the greatest dimension. Data suggested that the lowest 
quartUe of such biopsies with 30 microns of available tissue 
would contain between 19 and 34 ng of RNA. In order to 
enable accurate and reproducible quantitation of gene ex- 
pression when limiting copies of target RNA are present, a 
number of analytical changes to the Oncotype DX platform 
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Gene 

Figure 3 Boxplots for the RT-PCR positive control during analytical and clinical validation studies. Boxplots summarizing performance of 
eacli gene over analytical and clinical validation for RT-PCR positive control (prostate cancer FPE pool) representing 18 RT-PCR control plates 
stratified by gene (ARF1, ATP5E, CLTC, GPS1 and PGK1 are reference genes). Each RT-PCR control plate contains one positive control and each 
gene is measured in triplicate. Standard deviations ranged from 0.19 Cp to 0.33 Cp. The box represents the inter-quartile range, the line in the 
box represents the median and the diamond is centered at the mean. The whiskers represent minimum and maximum values observed. 



used for Breast and Colon cancer were required. More spe- 
cifically, the multiplexed preamplification step was intro- 
duced to create multiple copies of the starting RNA prior 
to quantitative assessment of gene expression. Given the 
very low yields and the resulting limitations to accurate 
measurement of extracted RNA for many samples, accept- 
ance criteria for the assay include a specification of the 
average of the reference genes rather than on mass of RNA 
input as the primary specification for sample quality. This, 
in turn, allows for the processing of variable RNA inputs, 
and samples containing as little as 5 ng of RNA have gen- 
erated reproducible results as demonstrated by the repro- 
ducibility of the individual genes and the GPS (Table 6). 
Volumes of the reverse transcription and qPCR reactions 
were reduced to preserve the genetic material, and robotic 
liquid handling was optimized for smaller volumes. A new 
genomic DNA detection assay targeting the promoter re- 
gion of one of the reference genes was created, and gDNA 
assessment performed as part of the final qPCR step. Mul- 
tiple positive and negative assay controls are included in 
every assay run to ensure consistent performing of the 
analytical process. 

The estimates of analytical precision and reproducibil- 
ity were obtained by analyzing 10 prostate tumor RNA 
samples on multiple instruments, using multiple reagent 
lots, by various operators over a period of 2 weeks. 
Standard deviation for precision for the analytical 
process was at or below 0.21 Cp for all gene assays and 
1.86 GPS units on a 100 unit scale. Reproducibility of 



Table 7 Performance of the RT-PCR positive control 



Gene Assay 


Mean 


Median 


SD 


ARFl 


22.7 


22.7 


0.20 


ATP5E 


24.3 


24.2 


0.33 


AZGPl 


24.6 


24.6 


0.23 


BGN 


23.7 


23.7 


0.22 


CLTC 


24.0 


24.0 


0.19 


C0L1A1 


22.4 


22.5 


0.25 


FAM13C1 


25.8 


25.8 


0.23 


FLNC 


25.3 


25.3 


0.24 


GPS1 


25.3 


25.3 


0.20 


GSN 


25.9 


25.9 


0.29 


GSTM2 


26.7 


26.7 


0.21 


KLK2 


19.4 


194 


0.20 


PGKl 


26.3 


26.3 


0.22 


SFRP4 


26.7 


26.7 


0.23 


SRD5A2 


28.1 


28.0 


0.32 


TPM2 


22.6 


22.6 


0.21 


TPX2 


29.6 


29.6 


0.25 



Summary statistics, including standard deviations for the RT-PCR positive 
control performance during analytical and clinical validation were calculated 
on the non-normalized scale. The plates w/ere assembled using multiple PGR 
and liquid handling robots and by multiple operators, using multiple lots of 
critical reagents spanning a period of 20 weeks (N=54 individual 
Cp measurements). 
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Figure 4 Boxplots of genomic DNA detection and qPCR positive controls. Boxplots summarizing the performance of genomic DNA 
detection and qPCR positive controls over Analytical and Clinical Validation. The box represents the inter-quartile range, the line in the box 
represents the median and the diamond is centered at the mean. The whiskers represent minimum and maximum values observed. 



the analytical process, in addition to the previously men- 
tioned sources of analytical variability, incorporated vari- 
ation in RNA input levels (i.e. 5 ng, 10 ng and 20 ng 
inputs). Reproducibility standard deviation for the ana- 
lytical process of the individual gene assays was at or 
below 0.21 Cp, and for the overall score was 2.11 GPS 
units. Given that the estimated total standard deviation 
in the GPS (including all pre-analytical, tissue related 
and biological between patient variability) is approxi- 
mately 11.4, the analytical assay variability is estimated 
to account for only 3.4% (= 100 x 2.11^ / 11.4^) of the 
total variation. In a separate study (£. Klein, manuscript 
submitted), within-block reproducibility of the GPS was 
assessed in biopsies from 46 patients (up to 4 separate 
samples per biopsy-containing block). The within block 
standard deviation for GPS in that study was 2.8 GPS 
units (95% CI, 2.5 to 3.1) indicating excellent reproduci- 
bility even when histological variability is introduced. 

Average amplification efficiency of the 17 gene assays in 
the Oncotype DX Prostate Cancer Assay was 93%, and all 
gene assays were within ±6% of that value. High analytical 
sensitivity, wide linear range (at least 10 logs) and low bias 
(under 9.7%) demonstrate that the assay is able to measure 
accurately gene expression on a wide population range 
using a very limited amount of RNA. 

The Oncotype DX Prostate Cancer Assay has been 
clinically validated [19], demonstrating that the GPS, 
assessed in diagnostic biopsy tissue, can predict the like- 
lihood of the presence of adverse pathology (high-grade 
and/or high-stage disease), and that it complements 
existing pre-treatment risk assessment tools such as PSA 



levels, Gleason Score, and clinical stage. The assay is 
intended to help guide treatment decisions in early-stage 
prostate cancer, including the decision between immedi- 
ate therapy and active surveillance. As evidence that the 
analytical assay was designed well for its intended use to 
test RNA from small biopsies, in a clinical validation 
study, valid GPS results were generated for more than 
95% of samples requiring 1 mm and 30 microns of 
tumor tissue [19]. 

Conclusion 

Optimization of the Oncotype DX platform has enabled 
the development and analytical validation of the 
Oncotype DX Prostate Cancer Assay for use with pros- 
tate biopsy specimens. This RT-PCR assay has been clin- 
ically validated to predict the risk of high grade and/or 
non-organ confined disease at radical prostatectomy 
using biopsy samples containing as little as 1 mm of 
tumor tissue. The Oncotype DX Prostate Cancer Assay 
complements traditional clinical and pathologic diagnos- 
tic features and will assist clinicians to discriminate pa- 
tients with indolent prostate cancer from aggressive 
prostate cancer to help make the most appropriate treat- 
ment decisions. 

Additional files 



Additional file 1: Oligonucleotide Sequences for each primer and 
probe. 

Additional file 2: Amplicon Sequences for 17 genes in the 
Oncotype DX GPS. 
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